Learning mixed behaviours with parallel Q-learning

نویسندگان

  • Guillaume J. Laurent
  • Emmanuel Piat
چکیده

This paper presents a reinforcement learning algorithm based on a parallel approach of the Watkins’s Q-Learning. This algorithm is used to control a two axis micro-manipulator system. The aim is to learn complex behaviours as reaching target positions and avoiding obstacles at the same time. The simulations and the tests with the real manipulator show that this algorithm is able to learn simultaneously opposite behaviours and that it generates interesting action policies with regard to the global path optimization.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Hybrid Unconscious Search Algorithm for Mixed-model Assembly Line Balancing Problem with SDST, Parallel Workstation and Learning Effect

Due to the variety of products, simultaneous production of different models has an important role in production systems. Moreover, considering the realistic constraints in designing production lines attracted a lot of attentions in recent researches. Since the assembly line balancing problem is NP-hard, efficient methods are needed to solve this kind of problems. In this study, a new hybrid met...

متن کامل

Solving a New Multi-objective Unrelated Parallel Machines Scheduling Problem by Hybrid Teaching-learning Based Optimization

This paper considers a scheduling problem of a set of independent jobs on unrelated parallel machines (UPMs) that minimizesthe maximum completion time (i.e., makespan or ), maximum earliness ( ), and maximum tardiness ( ) simultaneously. Jobs have non-identical due dates, sequence-dependent setup times and machine-dependentprocessing times. A multi-objective mixed-integer linear programmi...

متن کامل

Farmer Behaviours and Sustainable Water Management in Semiarid Konya Closed Basin in Turkey

Objective: This study aims to review group learning method effect compared to individual learning method on dyslexic students of second grade in elementary school and it evaluates whether their problem will be solved in group and by other`s help? Thus, two methods of learning- Jigsaw I and Jigsaw II methods -were used to review their effects on improving learning and reading of...

متن کامل

Farmer Behaviours and Sustainable Water Management in Semiarid Konya Closed Basin in Turkey

Objective: This study aims to review group learning method effect compared to individual learning method on dyslexic students of second grade in elementary school and it evaluates whether their problem will be solved in group and by other`s help? Thus, two methods of learning- Jigsaw I and Jigsaw II methods -were used to review their effects on improving learning and reading of...

متن کامل

A Hybrid and bio-inspired Architecture approach for self-configuring behaviours in Cognitive Agents

In this work, an hybrid, self-configurable, multilayered and evolutionary subsumption architecture for cognitive agents is developed. Each layer of the multilayered architecture is modeled by one different Machine Learning System (MLS) based on bio-inspired techniques such as Extended Classifier Systems (XCS), Artificial Immune Systems (AIS), Neuro Connectionist Q-Learning (NQL) and Learning Cl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002